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There has been considerable recent progress in clarifying the conditions under which quantum 
systems equilibrate. We demonstrate that effective equilibration of an isolated system can occur 
under much weaker conditions than previously considered, by requiring only an assumption of suffi- 
ciently complex dynamics (which encodes a condition on the complexity of the system eigenvectors) 
and natural information-theoretic constraints resulting from the infeasibility of collecting an astro- 
nomically large amount of measurement data. In particular, we prove that, for generic Hamiltonian 
systems, the measurement statistics for maximally fine-grained measurements on pure states are 
effectively indistinguishable from the micro-canonical distribution after a finite equilibration time. 
We demonstrate that our assumptions correspond to physically realistic conditions by numerically 
confirming their validity for both generic many-body systems limited to two-body interactions and 
the quantum kicked-top in a globally chaotic regime. 



Introduction. The mechanisms by which the characteris- 
tic features of quantum statistical mechanics emerge from 
an exact treatment of the quantum system are still not 
fully understood. In particular, it is not obvious how (or 
even whether) a pure state evolving according to some 
quantum dynamics approaches some form of equilibrium. 
Recently however, there has been a resurgence of interest 
in this problem [IHZ]> and progress in providing rigorous 
justification for the existence and time-scale of equilibra- 
tion of small subsystems of a larger system . 

Our work extends on this body of knowledge by show- 
ing that an assumption of complex dynamics is sufficient 
for an effective equilibration of measurement statistics 
for isolated quantum systems, without invoking mixed 
states, decoherence, subsystem traces or coarse-graining 
of observables. The form of equilibration we will study 
is an information theoretic one based on well-motivated 
operational constraints on the difficulty of predicting the 
detailed evolution of the system and the infeasibility of 
collecting an astronomically large amount of measure- 
ment data. We show that such equilibration is actually 
generic in the sense that it holds for almost all Hamilto- 
nian systems, where "almost all" is defined with respect 
to the Gaussian Unitary Ensemble (GUE). 

We further show that the strong form of randomness 
associated with GUE models is not necessary for mi- 
crocanonical equilibration by identifying a weaker suf- 
ficient condition on the complexity of the system eigen- 
vectors. We examine numerically whether this sufficient 
condition is met by - and effective information theoretic 
equilibration follows for - two important classes of phys- 
ical systems: (i) generic many-body systems restricted 
to two-body interactions (2-local Hamiltonian systems), 
as well as (ii) single-body, chaotic dynamical systems in 
the macroscopic limit, in particular, the quantum kicked 
top [T7] in a globally chaotic regime. This suggests that 
information-theoretic equilibration is not just mathemat- 
ically generic, with respect to GUE, but may also be 
generic for more physically realistic models. 



Our approach follows an established tradition of mod- 
elling complex physical systems with an appropriate en- 
semble of random matrices, a widely succesful approach 
that has been applied to many-body systems such as 
heavy nuclei and mesoscopic structures, as well as few- 
body but high-dimensional quantum systems associated 
with classically chaotic counterparts [T71 [TB] . This work 
develops and extends related ideas about the relevance of 
the randomness of the system's eigenvector statistics as 
a mechanism for microcanonical equilibration proposed 
earlier by Peres [15] and Srednicki [301 [H] and gives a 
theoretical explanation of the numerical equilibration re- 
sults in Refs. [HI [23]. 

Setup. Consider a dynamically isolated quantum system 
described by a Hilbert space H = C D , and which is in 
an initial state p . Further, let H be a fixed Hamilto- 
nian describing the reversible evolution of the system, 
p(t) = U{t)p U{t)\ where U{t) = e - ltH (taking h = 1). 
A quantum system is said to equilibrate if its state p(t) 
evolves towards some particular mixed state <r eq and re- 
mains close to it (i.e., remains indistinguishable from it 
in some practical sense [HE]) for almost all times. If 
there is such an equilibrium state, it must be the infi- 
nite time average state (sometimes called the generalized 
Gibbs state) a^, :— lim^oo ~ C p{t)dt. Note that, with- 
out further assumptions, may differ from the micro- 
canonical state a mc := 1/D or any thermal state [4|. 
The above form of equilibration has only been rigorously 
proven for the states of sufficiently small subsystems of 
the full system, and depends on the generic nature of ran- 
dom quantum states and their entanglement properties 
[51 15HT6], Of course, finite dimensional closed quantum 
systems cannot equilibrate in this strong sense, as any 
finite dimensional unitary evolution is quasi-periodic. 

In the following we instead consider a mathemati- 
cally weaker but operationally well-motivated form of 
equilibration for the statistics of global observables. To 
this end, we consider any maximally fine-grained (non- 
degenerate) global observable A acting on H, with spec- 
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tral decomposition A = J2k=i a kPk and rank-one pro- 
jectors Pfc. The primitive quantities of interest are the 
probabilities over outcome labels k, 



Pr (k\p(t)) = TV P k U(t)p U(tf 



(1) 



Clearly, if we can show equilibration occurs even for such 
maximal observables then equilibration will occur also 
for more the kinds of coarse-grained observables, which 
provide even less information per observation. 

Because we are interested in the dynamical properties 
of a probability distribution, we use an information the- 
oretic criterion for equilibration based on the difficulty of 
distinguishing between the precise quantum distribution 
Pr (k\p(t)) and the micro-canonical distribution. 

Definition 1. A state p(t) undergoing unitary dynamics 
U(t) on H = C D equilibrates information theoretically 
with respect to an observable A, after a time t eq , if for al- 
most all times t>t oq the outcome distribution Pr (k\p{t)) 
cannot be distinguished from the micro- canonical distri- 
bution Pr mc (/c) with high probability with computational 
resources, including the number of samples taken from 
Pr(fc|p(£)), that scale at most as 0(polylog(_D)). 

This definition captures the idea that although the 
exact quantum distribution for the system may be in 
principle distinguishable from the micro-canonical dis- 
tribution, the two are effectively indistinguishable if the 
resources needed to distinguish them exceed those prac- 
tically available. We choose to delineate the practical 
from the impractical by requiring that the resources, 
and in particular the number of measurement outcomes, 
by disallowing resources that grow faster than poly- 
logarithmically with the Hilbert space dimension, be- 
cause for many-body systems D scales exponentially with 
the number of subsystems. Of course, for a different 
physical scenario, a different cut-off may be appropri- 
ate. The important point is that, without the ability to 
efficiently predict Pr (k\p(t)), information theoretic equi- 
libration occurs whenever the outcome variance, 



s {P T (k)}~D- l ^2[Px(k)-D- x ] : 



(2) 



is sufficiently small. Note that, for simplicity of exposi- 
tion, we are focussing on cases where the micro-canonical 
distribution is uniform Pr mc (fc) := 1/D. 

Theorem 1. Consider a system which generates ran- 
dom outcomes chosen from the set {1,...,D} accord- 
ing to an unknown distribution Pr(fc) that is promised 
to have an outcome variance Vfc{Pr(fc)} that scales as 
0(D~ 2 ). Then at least 0{D 1 / i ) samples from the distri- 
bution Pr (k) are needed in order to distinguish it from 
Pr mc (fc) with high probability. 

The proof is provided in the supplemental material. 
The key question is now: which classes of Hamiltonian 



systems satisfy the assumptions of Theorem [T] and hence 
exhibit information theoretic equilibration? 
Generic Hamiltonians. In this section we show that infor- 
mation theoretic equilibration is mathematically generic 
in the sense that it holds for almost all Hamiltonian 
systems, where "almost all" is defined with respect to 
Gaussian Unitary Ensemble (GUE) Q21 HE]. GUE is 
the unique unitarily invariant distribution over Hermi- 
tian matrices H that factorizes into a product of distri- 
butions each over an individual element of H. In par- 
ticular, each clement of H is an i.i.d. Gaussian random 
variable. More details can be found in Appendix |B| 

Our motivation for considering GUE is based on the in- 
sights from decades of experimental and numerical stud- 
ies of complex many-body systems and few-body quan- 
tum chaos models in large Hilbert spaces [TTJ [TBI HI] • The 
well-established random matrix conjecture states that 
certain experimentally relevant features of these complex 
systems, such as the spacings between adjacent eigenval- 
ues, can be modeled statistically by the those of a random 
matrix drawn from an appropriate ensemble sharing the 
same symmetries. 

In practice one often models the feature of interest 
with the ensemble mean, rather than a random sample 
from the ensemble. The feature of the random matrix en- 
semble that justifies this substitution is the phenomenon 
known as concentration of measure, which describes sit- 
uations where the distribution of the property is tightly 
peaked about the ensemble mean. Concentration of mea- 
sure plays a key role in our argument, and it is impor- 
tant for the reader to bear in mind that our averages over 
the ensemble are not an implicit form of decoherence or 
mixing, but an established technique for predicting the 
properties of indvidual systems. Indeed we now show 
that equilibration is generic by demonstrating that the 
concentration of measure phenomena holds even for the 
outcome statistics of maximal observables. 

Theorem 2. Take a non- degenerate observable A act- 
ing on H = C , an initial pure state po = \x)(x\ 
which is an eigenstate of A, and an unknown unitary 



U(t) 



-itH 



where H is drawn uniformly at random 



from GUE. Then there exists a t cq (D) = 0{D~ x l & ) 
s. t. for all t > t cq (D) = 0(Z3~ 1//6 ) the mean, variance, 
and fourth moment of the measurement outcome proba- 
bilities Pv(k\x,t) over GUE are given by: 



E 



GUE 



{Pr(fcM)} 



D + 5 xk O{D) 



D 2 - 1 

V GUE {Pr(fc|x,t)} = 0( J D- 2 ), 
EGUEjPr^lM) 4 } =Q{D- A ). 



0(D- 



(3) 
(4) 
(5) 



For a proof see the supplemental material. This result 
implies that Pr(fc|x,t) is concentrated around 1/D for 
almost every Hamiltonian chosen from the GUE and t > 
t cq (D). In particular, by Chebyshev's inequality, in the 
limit of large D, with high probability over H drawn from 
GUE, the corresponding probabilities Pr(fc|x,i) are at 
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most a constant multiple of 1/D: 



0{D~ 



Pr (|Pr(fc|a;,t)-E GU E{Pr(fc|a;,t)}| > e) < 

(6) 

where fir) is the appropriate measure for the GUE. In 
other words, the probabilities Pr (k\x, f) for an individual 
system are very close to the ensemble average. 

These results imply a concentration of the outcome 
variance. Expanding the outcome and GUE variances 
and using Theorem [2j we find that, for t > t eq (D), 

V GUE {V fc {Pi'(fc|M)}} (7) 
= D- 2 E G u E {Pr (k\x, tf Pr (j\x, t) 2 } + 0(£- 4 ). 

Then using the Cauchy-Schwartz inequality for expecta- 
tions and Theorem [2] again, we have: 
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FIG. 1: The average over x of the RLH ensemble average of 
the probabilities P(x 7^ 0|0, t) of the evolved state plotted as 
a function of time for 5-, . . . , 10-qubit systems. This average 
equilibrates to 1/(D + 1) within an equilibration time that 



^2 E GUE{Pr {k\x, tf Pr (j\x, t) 2 | < 0{D 2 ). (8) scales approximately as 0{l/ s/\og{D)) 
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This proves that V G UE{V fe {Pr (k\x, t)}} = 0(D- 4 ). 
Chebyshev's inequality then implies (in the same fashion 
as ^) that V k {Pi(k\x,t)} G 0(D~ 2 ) with high proba- 
bility over GUE. 

Theorem [l] therefore implies that Pr(fc|:r,i) cannot be 
efficiently distinguished from Pr mc (fc). Hence, in high- 
dimensional Hilbcrt spaces, the vast majority of individ- 
ual Hamiltonian systems (which are at all times in a pure 
state), equilibrate information-theoretically. These re- 
sults imply, rather remarkably, that the dynamical pure- 
state quantum fluctuations for t > t eq become vanish- 
ingly small (in an experimentally relevant sense) in the 
limit of large D. 

Sufficient conditions for Theorems 1 and 2. It is interest- 
ing to note that the very strong assumptions of random- 
ness implicit in GUE (specifically, the Haar randomness 
of the energy eigenstates) are not necessary for Theo- 
rems [T] and [2] to hold, and identifying weaker sufficient 
conditions is desirable. 

As shown in the supplemental material, the object 
that underlies the distribution of Pr(k\x,t) is the uni- 
tary, C, which transforms the eigenbasis of H to that of 
the observable A. Working in the eigenbasis of A, we can 
write U(t) = CF{f)C\ where F(t) = diag[e-" B »], and 
{E a } E =1 are the energy eigenvalues of H. Given an initial 
pure state |x)(x|, the measurement outcome probabilities 
can then be written as 

Pr(fc|sB,t) = Tr[\k)(k\CF(t)C^\x)(x\CF(tfC^] . (9) 

Information theoretic equilibration requires that 
Vfe{Pr (k\x, t)} G 0(D~ 2 ), which in turn requires that 
we know certain properties of Pr (k\x, C, F(t)) . It is 
not difficult to see that Pr (k\x, t) and Pr (k\x, t) can be 
concisely represented by 

Pr (k\x, t) = (L 2 \C® 2 ® C® 2 \R 2 {t)), (10) 

P r (k\x,t) 2 = (U\C® A ®C® A \R A (t)), (II) 



where (L 2 \ = (k,x,x,k\, (L 4 \ = (k k\, and 



|i? 2 (;))=5>^^')|&,6',&,6'), (12) 

b,b' 

\R*(t)}= ^ Eb - E ^ +E ^- E ^\b,b',d,d',b,b',d,d'). 

(13) 



b,b',d,d' 



We refer to a term of the form (L 2 \C® 2 <g) C® 2 |6, 6', b, b') 
in the sum in ( 10 ) as a (2, 2)-term because there are two 
basis change matrices acting on each factor space. The 
analogous terms in ( [TT] ) will be called (4, 4)-terms. 

We see from the above that the strong randomness 
of GUE eigenvectors is not necessary, and that scal- 
ings given in eqns. ([3]),(|4| and ([5| are satisfied provided 
that the matrix elements of C, namely the Hamiltonian 
eigenvector components, satisfy a unitary i-design con- 
dition 25j. This insight has been identified recently in 
the context of subsystem equilibration in Refs. (TDl fTTl 
116) . where one requires eigenvector component reflect- 
ing GUE-randomness only for polynomials of degree at 
most t = 4. For microcanonical equilibration our proof 
technique shows that eigenvector components reflecting 
GUE-randomness for polynomials of degree at most t = 8 
is sufficient to imply that information theoretic equili- 
bration is typical using our Theorems [1] and [2] We now 
provide numerical evidence that such scalings are real- 
ized for typical systems drawn from a more physically 
realistic ensemble consisting of 2-local Hamiltonians. 
Equilibration of Local Hamiltonians. The class of A:-local 
Hamiltonians consists of those Hamiltonians composed 
of at most fc-body interactions (for fixed k) between d- 
level quantum systems. These Hamiltonians are ubiq- 
uitous in condensed matter theory and quantum com- 
puting [26 . We anticipate that such Hamiltonians have 
sufficient complexity to induce equilibration because the 
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FIG. 2: Numerically computed expectation values over 
k and b of the RLH ensemble average and variances of 
the dominant (4,4) term, (x, k, k, x\C 02 <g> C® 2 \b, b, b, 6), 
plotted as a function of D = 2 n , and compared to the 
analytically computed scalings for GUE. The variance of 
the (4,4) terms scale as 3790/D 8 for D > 128. 
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FIG. 3: The measurement outcome variance Vfc{Pr (fc|a;)} 
plotted as a function of system size, both before (2 kicks) 
and after (10 kicks) the equilibration time. For 10 kicks, 
we see excellent agreement with the GUE prediction of 
0(D~ 2 ), confirming information theoretic equilibration 
of a one-body system in the macroscopic limit in a regime 
of global chaos (a = [1.1,1.0,1.0] and t = diag(10,0, 1)). 



interactions they are composed of generate a universal 
gate set for quantum computing. Although difficult to 
prove analytically, we provide numerical evidence that al- 
most all randomly chosen 2-local Hamiltonians acting on 
qubits equilibrate information theoretically with respect 
to the observable A = <j\ ® • • • ® c™ (corresponding to 
a non-degenerate projective measurement in the compu- 
tational basis). This describes a wide-class of global ob- 
servables because the ensemble of random 2-local Hamil- 
tonians is invariant with respect to independent rotations 
of the individual qubits. 

We construct the ensemble of random 2-local Hamil- 
tonians on n systems (denoted RLH) as follows: 

n 

^EE^+EEE^-^ a*) 

i—l p i^j p p' 

where p, p' € {X, Y, Z}, and each ai and bi : j is a Gaussian 
random variable with mean and variance 1. 

Fig. [I] shows that systems evolving under RLH Hamil- 
tonians approach equilibrium as D increases. The equi- 
libration time can be difficult to define, but for concrete- 
ness we estimate it using the location of the inflection 
points of the curves in Fig. [I] These times scale as 
0(l/^/log(D)), similar to what would be expected from 
analysis of the Ehrcnfcst brcaktime for quantum chaotic 
systems [171 ^ |2"5 ] . Fig. [2] further shows that the RLH 
average of the (4, 4)-terms agrees with the GUE predic- 
tions of 0(D~ 4 ) scaling. We also find that the variance of 
the (4,4) terms is 0(D~ 8 ) suggesting that concentration 
of measure holds for these. This implies that similarly 
to the GUE case, V fc {Pr (k\x, t)} e 0{D~ 2 ) for an in- 
dividual system, and hence the conditions of Theorem [I] 
are met for almost all H taken from the RLH provided 
that t > ieq(-D) = 0(l/Vk>g(L>)). This is strong evi- 
dence that generic 2-local Hamiltonians exhibit informa- 
tion theoretic equilibration. 

Equilibration for Individual Systems. Information theo- 
retic equilibration of pure states can occur also for an 



individual system consisting of a one-body dynamical 
model associated with global classical chaos. A variant 
of the quantum kicked top, exhibits this property. It is 
described by the Floquet map (with h = 1) [TTj : 

Up = e~ ijTjT e- i3a , (15) 

where J = [J x , J y , J z ] is a vector of angular momentum 
operators, r is the moment of inertia tensor, which is di- 
agonal with entries [t xx , r yy , t zz ] and a — [cv x , cv y , a z ] is a 
vector of kick-strengths. The dimension of the system is 
D = 2j + 1 where j is the total angular momentum quan- 
tum number. We see in Fig. [3] that for a set of chaotic 
parameters for A and r, roughly 10 kicks (applications of 
Up) are needed in order for Vfe{Pr (k\x, t)} = 0(D~ 2 ), 
and hence for Theorem [T] to apply. 

Discussion. We have shown analytically that information 
theoretic equilibration holds generically for the proba- 
bility distributions of maximal measurements of closed 
Hamiltonian systems that satisfy a condition of sufficient 
complexity. Our condition of sufficient complexity is a 
natural extension of the random matrix conjecture |27j to 
low-degree polynomials of the eigenvector components of 
complex quantum systems [25J [25] . We have also demon- 
strated numerically that this condition is satisfied by a 
class of random 2-local (many-body) Hamiltonians, as 
well as an individual single-body dynamical system in a 
regime of global classical chaos. Remarkably, information 
theoretic equilibration is observed to hold for pure quan- 
tum states for each of these systems, without requiring 
any form of decoherence or restricting to coarse-grained 
measurements . 

In the context of random matrix models we showed 
that this weak form of equilibration holds for almost all 
Hamiltonians for any fixed observable. In the context of a 
fixed Hamiltonian model, one can ask: for which observ- 
ables is this form of equilibration expected to hold? The 
answer is already clear: almost all of them, because the 
relevant feature is the relative randomness of the basis 
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transformation from the energy eigenbasis to the observ- 
able's eigenbasis, or equivalently, the matrix of energy 
eigenvector components in the basis of the observable. 

Our key insight is that although dynamical pure states 
exhibit coherent fluctuations away from true micro- 
canonical equilibrium, these fluctuations remain small 
and appear pseudo-random, and hence their detection re- 
quires extraordinary experimental resources, such as col- 
lecting O(Z) 1 / 4 ) measurement outcomes from repetitions 
of the exact experimental conditions, or pre-computation 
of the location of the dynamical state in a D-dimensional 
Hilbert space, or performing joint (entangling) measure- 
ments on identical copies of the system. In the absence of 



such resources, by Theorem[T]we see that after some finite 
time (which depends on the details of the energy spec- 
trum) the probability distributions for dynamical pure 
states cannot be distinguished from the micro-canonical 
distribution. 
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Appendix A: Proof of Theorem 1 

Proof of Theorem^ First, notice that the lack of knowl- 
edge of the form of the distribution Pr (k) implies that 
the labels of the particular outputs k are of no signifi- 
cance. This means that we can only use the frequency of 
coincidences in the output in order to distinguish Pr (k) 
from the uniform distribution. 

It is straightforward to see that 0(VT>) observations 
are needed to be able to obtain a pair of identical mea- 
surement outcomes drawn from the uniform distribution. 
We compare this to a distribution with Vfe{Pr(fc)} = 
0(D~ 2 ) that has the highest probability of coincidence 
for some outputs. We achieve this by imposing a restric- 
tion on Pr (fc) that is weaker than Vfc{Pr(fc)} = 0(D~ 2 ) 
but is also implied by the variance constraint. Using the 
the definition of variance, V^{Pr (k)} = D^ 1 J2k(^ r (^) — 
D^ 1 ) 2 , and expanding the sum and using J2k ^ r CO = 1; 
we find 

V fe {Pr (fc)} = 0(D- 2 ) Pr (k) < 0(ZU 1//2 ), V k. 

( A1 ) 

We will now drop the variance restriction and find a 
distribution Q(k) that only satisfies Q(fc) < 0(D^ 1 / 2 ) 
and also maximizes the probability of coincidence. This 
distribution is, for some constant a, given by 

Q(k) = \VD ■ V a . (A2) 
(0 : else 
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Given the distribution Q, the probability that there are 
no coincidental outcomes after N samples is given by: 



— 1 1 



N 



= 1 



N(N - 1) 



0{N/D). 



D - N + 1 

/ \ v " / 

(A3) 

Therefore at least N = 0{D 1 / i ) samples are needed 
to conclude with high probability that Q{k) is not uni- 
form if Q(fc) < OiD- 1 / 2 ). This implies that a distribu- 
tion Pr (k) with V fc {Pr (k)} = 0(D- 2 ) cannot be distin- 
guished from the uniform distribution with high proba- 
bility unless N > 0{D 1 / i ) 1 as claimed. □ 



Appendix B: Proof of Theorem 2 



We prove Theorem [2] in two steps: 
1. We derive expressions for the Haar expectations of 



Pr(k\x,C,F(t)) (see |B2]) below) and the second 
and fourth moments. See Lemmas [TJ [2j and[3j 

2. We find the expectations of these expressions over 
the GUE eigenvalue distribution, and show that 
there exists a time t cq (D) = 0{D~ X / 6 ) such that 
for all t > t cq (D), eqns. @, and ^ hold. See 
Section |B~2| for these expectations. 



Since the calculations in this subsection require in depth 
knowledge of the properties of the Gaussian Unitary En- 
semble (GUE), we will begin by giving a brief review of 
its properties [TTJ [TS1 [30] ■ A Hermitian matrix H gen- 
erated according to the GUE has diagonal elements h aa 
that are real valued random variables each with distri- 
bution Af(0,a 2 ), and off-diagonal elements h a b with real 
and imaginary parts that are random variables each with 
distribution A/"(0, |cr 2 ). The variance a 2 is a free param- 
eter, which is closely related to the expected maximum 
energy eigenvalue as well as the ensemble average energy 
level spacing. 

There are several important drawbacks to using GUE 
which deserve mention before moving on. First, the av- 
erage level density, which takes the form of a semi-circle, 
is not a good model for physically relevant Hamiltonian 
systems, even chaotic ones. In particular, for most natu- 
ral systems, such as those with only two particle interac- 
tions, the norm of the Hamiltonian scales polynomially 
with the number of particles [8]. However, the expected 
norm of a GUE Hamiltonian scales polynomially in the 
Hilbert space dimension D [17] . As we will see below, 
this has a large impact on what might be called the equi- 
libration time for these dynamical systems, which should 
therefore be taken with a grain of salt. Because of this, 
and in order to simplify calculations, we will follow the 
standard practice [Til EH] 01 taking the variance a 2 = \. 

The joint distribution over all elements of H factor- 
izes into a product of a distribution over eigenvectors, 
and a distribution over the energy eigenvalues of H (see 



[T5] Theorem 3.3.1, or [T7] Chapter 4). Further, the 
joint probability distribution over eigenvectors of H is the 
same as that over the change of basis matrix C, namely 
the Haar measure on the unitary group U(D) [TTl ITS] . 
Letting C be the unitary which takes the eigenbasis of H 
to that of A, and working in the eigenbasis of A, we can 
write 



U[t) = CF(t)C\ 



(Bl) 



where F(t) :— d\a,g[e~ ltEa ], and {E a }^ =1 are the energy 
eigenvalues of H. We can therefore take separate expec- 
tations over eigenvectors and eigenvalues, namely, over 
the matrices C and F. Pr (k\x, t) can be expressed as 

Pr (k\x,C,F(t)) =Tr[|fc)(fc|CF(t)C t |x)(x|CF(t) t C t ] . 

(B2) 

In the following we will write the expectation over the 
Haar measure on the unitary group U(Z?) of change of 
basis matrices C, as Ec{-}, and E spcc {-} for the expec- 
tation over the GUE eigenvalue distribution. 



1. Expectations over eigenvectors 

Lemma 1. Take a non- degenerate observable A acting 
on H = C , an initial pure state po — \x)(x\ which is 
an eigenstate of A, and a unitary U(t) — e~ ltH where 
H is drawn uniformly at random from GUE. Then the 
variance of the measurement outcome probabilities over 
the Haar measure on the unitary group U(-D) of change 
of basis matrices C is given by: 

V c {Pr(k\x,C,F(t))} 



D 4 i 



i){ { D 2 -2D + A + (7-2D)S xk 
+ \pL(t)\ 2 (2D5 xk -l05 xk ~2) 
+ S xk \p(2t)\ 2 + 2S xk Re[fi(t) 2 p(~2t)]} 



0(D' 5 ), 
(B3) 



where we have defined fi(t) — Tr[U(t)] — Tr[_F(i)] (this is 
often called the spectral form factor 1 19)). 

Proof. First, note that the squares of the outcome prob- 
abilities Pr (k\x, C, F(t)) can be written in the form: 

Ti[\k)(k\CF(t)C^\x)(x\CF(t^C^] 2 



i\k){k\CF(t)C*\x){x\CF(t)iCft\s) 



x {s'\k){k\CF{t)C^\x){x\CF{tytf\s') 
= (L 4 \C® 4 ®C® 4 \R 4 {t)), (B4) 

where C is the complex conjugate of C and 
\Ri(t))= J2 e i4 ^-^'+^-^') 



b,b',d,d' 

x \b,b',d,d',b,b',d,d'), 



(B5) 



7 



The expectation of the expression C® <g) C® over Haar 
measure can be written as the projector onto the sub- 
space spanned by the vectors 

= (T4 <8 1) |0>l,5|^2,6|^3,7|0>4,8, (B6) 

where \(j>)ij — 53a=i \ a )i\ a )ji an( i the index tt runs over 
the 4! permutations of the elements {1,2,3,4}, and 
is the unitary permutting the first four factor spaces ac- 
cording to tt. It was shown in [TU] that this projector is 
given by 



7T,(T 



(M-^J^X^I, (B7) 



where the matrix M has components M^^ = ly^Va) = 
TrfV^,— i V a \ — d 1 ^ 1(J \ and 1(a) is number of cycles in 
the cycle decomposition of the permutation 7r _1 o\ We 
then have 

E c {Pr (k\x,C,F(t)) 2 } 

= J2( L i\^)(M-%,A^a\Ri{t)), (B8) 

7T,<T 

where the inner products (Q^lR^t)) are given by: 







($(1,2,3,4) 


R 4 (t)) 


= \m\ 


$(1,2,4,3) 




($(1,3,2,4) 


R 4 (t)) 


= d\p(t)\ 2 , 


$(2,1,3,4) 


R 4 (t)) = 


($(4,2,3,1) 


R 4 (t)) 


= dW)\ 2 , 


$(1,3,4,2) 


Ri(t)) = 


($(1,4,2,3) 


R 4 (t)) 


= W)\\ 


$(2,3,1,4) 


Ri(t)) = 


($(2,4,3,1) 


R 4 (t)) 


= \m 2 , 


$(3,1,2,4) 


Ri(t)) = 


($(3,2,4,1) 


R 4 (t)) 


= \m\ 


$(4,1,3,2) 


Ri(t)) = 


($(4,2,1,3) 


R 4 (t)) 


= W)\\ 






($(3,4,1,2) 


R 4 (t)) 


= \^t)\\ 






($(1,4,3,2) 


R 4 (t)) 


= p(t) 2 p(2t), 






($(3,2,1,4) 


R 4 (t)) 


= p(2t)p(t)\ 


$(2,1,4,3) 


Ri(t)) = 


($(4,3,2,1) 


R 4 {t)) 


= d 2 , 


$(2,3,4,1) 


M*)) = 


($(3,4,2,1) 


R 4 (t)) 


= d, 


$(2,4,1,3) 


R 4 (t)) = 


($(3,1,4,2) 


R 4 (t)) 


= d, 


$(4,1,2,3) 


R 4 (t)) = 


($(4,3,1,2) 


R 4 (t)) 


= d, 



Further, (i 4 |$ w ) = 1 for tt = (4,3,2,1), (4,1,2,3), 
(2, 3, 4, 1), (2, 1, 4, 3), and (L 4 |$ 7r ) = 6 lk for all other tt. 
Taking the sum of the above terms with M~ l in eqn. 



E 



(|B8j), we find: 
c{Pr(k\x,C,F(t)) 2 } 

= ~{\p J (t)\ 4 ((D 2 -D-2)6 xk + 2) 
+ \p(t)\ 2 (-AD 2 -12D- 8+ (4L> 3 - 
+ (D 2 - D - 2)8 xk (\p(2t)\ 2 + p(2t)p(t) 2 + ii{2t)m 2 
+ 2D 4 + 8D 3 + 6D 2 - (4L> 3 + l2D 2 )5 xk 

+ 2\p(2t)\ 2 + 2ji(2t) h i(t) 2 + 2fi{2t)fi(t) 2 Y (B9) 



-SD 2 +4D 

\2 



8)8 xk 



where a = D 2 (D - l)(D + 1)(D + 2)(D + 3). Using 
E c {Pr (k\x, C, F(t))} from Lemma [2] we then have 

Y c {Pt (k\x,C,F(t))} 

= ^{d 2 -2D + A+(7-2D)S xk 

+ \p(t)\ 2 (2D6 xk ~lQ6 xk -2) 

+ SM2t)\ 2 + 2S xk Re[p J (t) 2 ^(-2t)]} + 0{D- 5 ). 

(BIO) 

□ 

Lemma 2. Take a non- degenerate observable A acting 
on H = C D . an initial pure state po = \x)(x\ which is 
an eigenstate of A, and a unitary U(t) — e~ ltH where H 
is drawn uniformly at random from GUE. Then the ex- 
pectation of the measurement outcome probabilities for C 
distributed according to the Haar measure on the unitary 
group V(D) is given by: 



E c {Pr(k\x,C,F(t))} = 



D - 6 xk + \p(t)\ 2 (6 xk - ±) 



D 2 - 1 



(Bll) 



Proof. This expectation can be calculated in a similar but 
simpler fashion as the variance in the previous lemma, so 
we leave the proof as an exercise. □ 



Lemma 3. Take a non- degenerate observable A acting 
on H = C D , an initial pure state po — \x)(x\ which is 
an eigenstate of A, and a unitary U(t) = e~ ltH where 
H is drawn uniformly at random from the GUE. Then 
the fourth moment of the measurement outcome probabil- 
ities over the Haar measure on the unitary group U(D) 
of change of basis matrices C is given by: 



E 



c{vr(k\x,C,F(t)f} 

^ TjsE^I^^+E^^I^W)' (B12) 



-9\ 



where B is a matrix with components < 0(D 

Proof. The fourth power of the outcome probabilities can 
be written in the form: 



(B13) 



Tr[\k)(k\CF(t)C^\x)(x\CF{tyC^] 4 
= (L 8 \C® 8 ®C® s \R s (t)), 



where (L 8 | and \Rs(t)} are defined analogously to ( |B5[ ), 
but with twice the number of tensor factors. Further, 
the average of the expression C® 8 (g> C 1 ® 8 over Haar mea- 
sure can be written as the projector onto the subspace 
spanned by the vectors 

1*0 = (V n «» 1) |0)i, 9 |0) 2 ,io ■ • . |^)8,i6, (B14) 

where the index tt runs over the 8! permutations of the 
elements {1,2, ... ,8}, and V„ is the unitary permuting 
the first eight factor spaces according to tt. 



We will now determine the asymptotic scaling of the 
following expression with D: 

E c {Pr(fc|x,C,F(i)) 4 } 

= Y,( L s\®*)t M ~%A®°\Mt)), (B15) 

by finding an approximation for M^ 1 . Recall that 
the matrix M has components M„ = (V^|V^) = 
Tr[K_iVy = D l ^~ 1<T \ where 1(a) is number of cycles 
in the cycle decomposition of the permutation 7r~ 1 a. It 
is clear that the diagonal components of M are all equal 
to D s , and all other components of M are strictly < Z? 8 , 
as the identity permutation is the only one with 8 cycles. 
Letting A := 1 - M/D 8 , we have 
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(t-A) = t-A 



N+l 



which converges to 1 in the limit N — 
1 1 -A 1 1 p < 1. Because A is of fixed size 
elements are < 1/D, we have 
large. Therefore, (1 — A)^ 1 
implies that 

1 



(B16) 

oo if and only if 
V. x 8! and all its 
o P < 1 for D sufficiently 
1 + A + 0(D- 2 ), which 



Ar 1 = 



D 8 



0(D~ 9 ). 



(B17) 



Next, it is not difficult to see that all components of 
(Lgj^Tr) are equal to 1 or 8 x f.. Therefore, we can split the 



sum m 



as 



E c {Pr{k\x,C\F{t)f} 



Y.B^A^lMt)), (B18) 



where we have used that (Lgl'Iv) < 1 for all ir, and B = 
M _1 - -A, has all components < 0(D~ 9 ). □ 



2. Expectations over GUE eigenvalues 

Our next step towards proving Theorem [2] is to find 
the GUE average of the the spectral form factor nit), 
which appears in Lemmas [I] and [2] The joint distribu- 
tion over the un-ordered energy eigenvalues {E a } E =1 of a 
GUE matrix is given by (see [IS] Theorem 3.3.1, or [T7] 
Chapter 4): 



D 



l=a<b 



(B19) 
Integration of 



where C D = (2it) d I 2 g d2 ]J 

P({E a } D ) over D — m variables gives the m-point corre- 
lation function ([TS] 6.1.1) 



Hi 



/OO 
P{{E a } D )dx m+l ■ ■ ■ dx D , 
-OO 

(B20) 



where \ = D\/(D — m)\. By [TS] Theorem 5.1.4, this can 
be written as: 



R m (Ei, ■ ■ ■ ,E m ) — dct[K D (Ei, Ej)] i: j-i t 



(B21) 



where K D (E U E 3 ) = J2k=a <f>k(Ei)(t>k(Ej) , and the cf> k (x) 
are the harmonic oscillator wave-functions 4>j{x) = 

(2^!^)" 1/2 e- x2 / 2 Hj(x), with Hj(x) the Hermite poly- 
nomials (see [18] section 6.2). 



Calculation o/E spcc {|/i(t)| 2 } 



Our next goal is to evaluate the expression T(t,D) 



E 



>:>ec{>(*)| 2 } 



E 



spec 



the trace and grouping terms, we have 
r(i,£)=E spcc {|Tr[e-^]| 2 } 



{|Tr[e-^]| 2 }. Expanding out 



D 

E« 

i=i 



,UE, 



P({E a } D )l[dE j 



3 = 1 



/oo D 
P({E a } D )l[dE j 

3 = 1 

r oo D D 
J -°°l =j tm j=l 

(B22) 

Using the invariance of the joint distribution P({E a } D ) 
under permutations of the energies, and the definition of 
the 2-point correlation function, this becomes 

/oo 
e lt( - E2 - El) R 2 (E 1 ,E 2 )dE 1 dE 2 . (B23) 
-oo 

It is well known that for large D the function Ku(E,E) 
follows the so called Wigner semi-circle law [TH] (with 



1/2): 



K D (E,E) ~ -^2D-E 2 Q(V2D- \E\), 



(B24) 



where Q(x) is the Heavyside step function. For large D 
we then have: 



r(t,D) = D + 2D 



Ji 



U/2Dt\ 

~~t 2 



-(V2D-t/2)G(2V2D-t), (B25) 

where Ji(x) is the first Bessel function of the first kind. 
It should also be noted that, the limit of the function 
K D (Ei, E 2 ) is generally taken by rescaling (often called 
'unfolding' - see [23] section III.A.l) the energy level den- 
sity as well as the energies by the local mean spacing 
[17l ITS] . In taking this limit and the integrals for T (t, D) 
we have followed the the procedure of [TH] Appendices 
10, and 11. 
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b. Equilibration time 

Now that we have the GUE expectation of the spectral 
form factor, we can use Lemma [2j to find the full expec- 
tation over eigenvectors and eigenvalues of the outcome 
probabilities: 



E. 



S pcc,c{Pr(fc|x,C,.F(*))} 



D-±r(t,D) + 6 xk (T(t,D)-l) 



D 2 - 1 

Notice that if T(t, D) = 0(D), then 

D + S xk O(D) 



(B26) 



E. 



spcc,c{Pr(fc|x,C,^(i))} = 



D 2 - 1 



0(D~ 



(B27) 

In particular, if D is large, then the GUE expectation of 
the probability distribution from ( B27 ) is essentially the 



uniform distribution. We therefore take the equilibration 
time t eq (D) to be defined by the condition 



t eq (D) := {T | T(t,D) = 0(D) i yt > T}. 



(B28) 



It is clear that the second term in (|B25|) satisfies: 



lim 2D- 



'2Dt 



t 2 



0. 



(B29) 



This shows that there exists a finite time t eq (D) such 
that T(t, D) = 0(D) for all t > t cq {D). 

In order to get a sense of the equilibration time (keep- 
ing in mind our previous comments on GUE energy spec- 
tra), note that the condition on the equilibration time 
(B28) is essentially that 



2D- 



Ji V2Dt 



t 2 



0(D). 



(B30) 



Using the fact that for x 3> 3/4 we can approximate 
Ji(x) ~ y / 2/'7nECOs[2; — 37r/4], it the follows that the 
equilibration time is 



t eq (D) = Op" 1 / 6 ) 



(B31) 



c. GUE expectation of first and second power 

Our next step towards a proof of Theorem [2] involves 
evaluating the expectation values over the spectrum that 
are needed to prove eqns. ([3]), and Q. First, from the 
previous section we have that there exists a finite time 
t cq (D) such that r (*,£>) = 0(D) for all t > t cq (D). 
Using Lemma [2j and the above spectral expectation, we 
have 

E spec , c {Pr(k\x,C,F(t))} = D+ ^ 2 xk0 ^ D) +0(D- 2 ), 

(B32) 



which proves that for all t > t cq (D), eqn. [3] holds. 
Next, Lemma [l] implies that 

E spcc {Y c {Pr(k\x,C,F(t))}} 

= j^{d 2 -2D + 4 + (7- 2D)S xk 

+ T(t,D) (2D6 xk - 105 xk - 2) + 6 xk T (2t , D) 

+ 26 xk E spcc {Re[ti(t) 2 fx(-2t)}} } + 0(Z?- 5 ). 

(B33) 

In the next section we will show that for t > t cq (D), 
E spcc {Re[n(t) 2 n(-2t)}} = O(D). Using this along with 
r (*,£>) = O(D) in the above, we find 

E sp ec{V c {Pr (k\x, C, F(t))}} - 0(D- 2 ), (B34) 

which proves that for all t > t cq (D), eqn. [4] holds. 

d. Bounding E, Bpec {[j,(t) 2 fj,(—2t)} 

In order to simplify the derivation of our upper bounds, 
we define 

A(t,D) := E spec {n(t) 2 n(-2t)} 

/OO -P 
^ e «(B.+^-2B fc ) < P ({E a } D )*[[dE u 
-OO ■ ■ ,, v_t 



1=1 



(B35) 



We can expand the triple sum into three parts depend- 
ing on whether i,j,k are: (i) all distinct, (ii) only two 
equal, or (iii) all equal, which upon integration over the 
remaining energies, give terms of the form: 

(i) R 3 (E 1 ,E 2 ,E 3 )e it ^+ E -- 2E ^, 

(ii) R 2 (E 1 ,E 2 )e 2U ^- E ^ +2R 2 (E 1 ,E 2 )e tt ^- E ^, 

(iii) D. 

By expanding i?3 and R 2 , we find products of inte- 
grals of terms of the form Kr>(Ei 1 Ei)e ltiEi , as well as 
K D (E i ,E j ) 2 e i( - tiEi+t i E i\ and 

K D (E 1 ,E 2 )K D (E 2 ,E 3 )K D (E 3 ,E 1 )e^ El+E "- 2E3 l 

(B36) 

We will bound the magnitude of these terms in a similar 
fashion as was done in [TT]. Note that the integral of 
(B36) is just 



Tr[Pe tEl Pe lE2 Pe~ 2lE3 ] , 



(B37) 



where P = ^2 k =Q l^fcX^fcl ^ s * ne projector onto the D- 
dimensional lower-energy subspace spanned by the har- 
monic oscillator wave- functions, and X is the position 
operator. Using the Cauchy-Schwartz inequality twice 
on JB37], we find: 



\T T [(Pe lEl Pe lE2 )Pe 



■ A/VTr[P]Tr[P]Tr[P] 
< D. (B38) 
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A similar argument also shows that terms with integrands 
of the form Kjj(Ei, Ej) 2 e l ^ tiEi+tjE ^ are also bounded by 
D. Using these bounds, we then have 

\A(t,D)\ < 6D 

+ K D (E,Ey tE ^j (J°° K D {E 1 E)e- 2UE ^j 

+ D ^J°° K D {E, E)e- 2UE ^j + 2D (J°° K D (E, E)e ltE ^j 



/OO ^ P oo 

K D (E,E)e 2ltE +2 K D (E,E)e ltE 
-oo J —oo 



Next recall that 



Jx(V2D t) 



f°° K D {E,E)e UE = V2D'^^ 

J — oo t 



(B39) 



(B40) 



which approaches as t — > oo. The equilibration condi- 
tion |B28} requires that \/2DJ 1 {y/2D t)/t = 0(\/Z)) for 
t > t eq (D), and if this is satisfied, then for all t > t oq , we 
have \A(t,D)\ < 0(D). 



e. GUE expectation of fourth power 

The final quantity that we need to prove Theorem [2] is 
the asymptotic scaling of the GUE average of the fourth 
moment of Pr (k\x). This quantity shows a concentration 
of measure for the outcome variance of Pr(/c|a;), which 
will allow us to conclude that individual Hamiltonians 
drawn from the GUE will information theoretically equi- 
librate with high probability. Specifically, we show that 
for t > t cq (D) we have that E speCjC jPr (k\x, C, F(i)) 4 | = 

0(D- 4 ). Recalling the form of E c {Pr (k\x, C, F(i)) 4 | 

from Lemma [3j we see that it is sufficient to show that 
for t > t cq (D), we have E spcc {($ ff \R 8 (t))} < 0(D 4 ), for 
all permutations a. 

Calculating some explicit examples of 

_ e it(E a -E a ,+E b -E b ,+E c -E c ,+E d -E d ,) 

a, a , 
c,c 

x (a, a', b, b' ', c, c', d, d'\V a \a, a' , b, 6', c, c', d, d'), 

(B41) 



we see that these are of the general form 



D^(/i*Msi*)V--,M/4tMs4*) 4 



where fj,gj € {0, 1, 2, 3, 4}, and 



(B42) 



(B43) 



4 

i=1 j=1 ■ , \ 

and if fj = then the corresponding n(fjt) does not 
appear in the product (and similarly for gj). For exam- 
ple, there is a |/i(i)| 4 term arising from a = 1, and a .D 4 
term arising from er = (12) (34) (56) (78). The expecta- 
tions E spcc {D a n(f 1 t)fi(g 1 t)*, . . . /x(/ 4 t)^(g 4 t)*} can t hen 
be bounded in a similar fashion to Appendix |B 2 d In 
particular, we can expand the sums in the product of 
the terms n(fjt) = Tr[e l ^>' t - H ] into various parts depend- 
ing on which indices are equal or not equal, just as was 
done for expression ( B35 1 . These will then give a con- 
stant D b factor, and various combinations of integrals 
of m-point correlation functions. Each of the integrals 
with 2-point or higher order correlation functions can be 
bounded by D, just as was done for eqn. (B36|. We will 



then be left with various powers of integrals of the form 
J^ao K d(E , E)e ltE which as we have seen approach as 
t — > oo, and so are irrelevant for the t > t eq regime. The 
only remaining question then is power of the constant D b 
factor for each term. It is not difficult to see that a power 
of D arises from each pairing fi(fjt)fi(gkt)* with fj = g^. 
For example, the expectation of the term |/z(£)| 4 gives a 
contribution of D 2 (this is in fact the infinite time limit). 
From the constraint (B43), it is not difficult to see that 
D 4 is the highest power of D which can arise. This shows 
that for t > t ea 



E spec , c {Pr(k\i,C,F(t)) 4 } = Op- 



Putting all of the above together, we have: 



(B44) 



Proof of Theorem [1[ Theorem [2] follows directly 
from |B32] ), jB34| an d (|B44| for t > t cq (D) = 0{D^ 1 / & ), 
which was shown in ( |B31[ ). □ 



